Cycle-Accurate Microarchitecture Performance Profiling

نویسندگان

  • Richard Hough
  • Phillip Jones
  • Scott Friedman
  • Roger Chamberlain
  • John Lockwood
  • Ron K. Cytron
چکیده

We present the design, implementation, and evaluation of a circuit we call the Statistics Module that captures cycle-accurate performance profiling at (or above) the microarchitecture layer. The circuit is deployed using an FPGA in the context of a soft-core characterization of a SPARC architecture (LEON). Accessible over the Internet, the circuit can be dynamically configured to capture program-method-level statistics of any subset of predefined VHDL signals. The circuit is deployed outside the actual soft core, so that its operation does not interfere with a program’s execution at any level. In contrast with simulations, entire program executions can be captured, and the resulting statistics include all runtime artifacts, such as multithreading, operating system support, and external interrupts. In contrast with software-introduced instrumentation, the measurements do not affect the statistics, and microarchitecture characteristics are easily captured. Our design avoids the otherwise combinatorial size of circuitry that would be required to accommodate all methods and events, scaling well with the number of artifacts that are actually measured. We have used this circuit to measure cycle-accurate cache-RAM statistics, such as cache hits and misses, RAM reads and writes, using both write-through and write-back policies. In this paper, we show the scalabilty of our design as it accommodates more methods and events. This work was sponsored by the National Science Foundation under grant ITR–0313203.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-Automatic Microarchitecture Configuration of Soft-Core Systems

The major goal of our project is to measure and improve application performance, by providing an easily and efficiently reconfigurable architecture along with software support to expedite its use. Our FPGA-based system is implemented as an extensible hardware module on the Field-programmable Port Extender (FPX) platform at Washington University. The FPX platform provides an environment where a ...

متن کامل

An Evaluation of Stratified Sampling of Microarchitecture Simulations

Recent research advocates applying sampling to accelerate microarchitecture simulation. Simple random sampling offers accurate performance estimates (with a high quantifiable confidence) by taking a large number (e.g., 10,000) of short performance measurements over the full length of a benchmark. Simple random sampling does not exploit the often repetitive behaviors of benchmarks, collecting ma...

متن کامل

Stallscope: Illuminating the Black Box

As microprocessors become increasingly more complex, cycle-accurate simulation has become a valuable tool for performance analysis and microarchitectural exploration. However, parallelism, complex interdependencies, and deep pipelining in modern superscalar processors make it difficult to identify how a particular microarchitectural design feature ultimately affects performance, particularly in...

متن کامل

Microarchitecture Level Interconnect Modeling Considering Layout Optimization

— In this paper, we study microarchitecture-level interconnect modeling for power and performance. Considering structural interconnects, layer assignment, and concurrent repeater and Flip-Flop (FF) insertion, we develop cycle-accurate microarchitecture-level power and throughput simulation and obtain an accurate modeling of interconnects at the early design stage. Experiment show that the simul...

متن کامل

Cycle-Accurate Microarchitecture Performance Evaluation

We present the design, implementation, and evaluation of a circuit we call the Statistics Module that captures cycle-accurate performance data at (or above) the microarchitecture layer. The circuit is deployed introspectively—in the architecture itself— using an FPGA in the context of a soft-core implementation of a SPARC architecture (LEON). Accessible over the Internet, the circuit can be dyn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005